The Single Source Two Terminal Network with 

Network Coding 
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Abstract — We consider a communication network with a single 
source that has a set of messages and two terminals where each 
terminal is interested in an arbitrary subset of messages at the 
source. A tight capacity region for this problem is demonstrated. 
We show by a simple graph-theoretic procedure that any such 
problem can be solved by performing network coding on the 
subset of messages that are requested by both the terminals and 
that routing is sufficient for transferring the remaining messages. 



I. Introduction 

The seminal work of Ahlswede et al. [1] established that 
for the single-source multiple-terminal multicast problem the 
achievable rate was the minimum of the maximum flows to 
each terminal from the source. They showed that in general, 
it is necessary to perform network coding to achieve this 
capacity. The basic idea is to give the nodes in the network 
the flexibility of performing operations on the data rather 
than simply replicating and/or forwarding it. Li et al. [2] 
showed that linear network coding is sufficient for achieving 
the capacity of the transmission of a single source to multiple 
terminals. Subsequent work by Koetter and Medard [3] and 
Jaggi et al. [4] presented constructions of linear multicast 
network codes. A randomized construction of multicast codes 
was demonstrated by Ho et al. [5]. 

It is important to realize that the multicast capacity result 
of [1] assumes that all the terminals are interested in the 
same data. The general network coding problem with multiple 
sources and terminals and an arbitrary set of connections is 
much harder and not much is known about it. In fact it has 
been shown in [6] that non-linear network codes are necessary 
in certain non-multicast problems. Network coding has also 
been considered from a lossless compression point of view in 
[7][8][9][10]. 

In this paper we study a specific example of a non-multicast 
problem with a single source and two sinks. We find a 
tight capacity region for this problem. This problem was 
independently considered by Ngai and Yeung [11] and Erez 
and Feder [121^. However our method of proof is very different 
and is based on a simple graph-theoretic procedure that may 
be of independent interest. This procedure was also utilized in 
[10]. 

'We became aware of this work after the submission of the current paper. 
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II. Problem Formulation 

Consider a communication network modelled as a directed 
graph G, with a specified source node S and two terminal 
nodes Ti and T2. We assume that the links are noiseless and 
that each edge in G has unit capacity. This assumption can be 
realized by picking a suitably large time unit, assuming suf- 
ficient error-correction at the lower layers of the network and 
splitting edges of higher capacity into parallel unit capacity 
edges. 

Suppose that the source node S observes three independent 
processes A"o,Xi and X2 such that terminal Ti is interested 
in (Xq, Xi) and terminal T2 is interested in (A"o, X2). Let the 
entropy rates of the processes be Hq, Hi and H2 respectively. 
We show the necessary and sufficient conditions for the 
feasibility of this connection. Furthermore it is shown that this 
problem can be solved by a combination of pure routing and 
network coding, where the sources Xi and X2 can be simply 
routed to Ti and T2 whereas the source may need network 
coding. The case of connections between terminal nodes is 
handled more naturally in our framework as compared to [11]. 

In the sequel the capacity assignment to an edge a ^ 6 is 
denoted by cap [a b) and the minimum cut between nodes 
Vi and V2 is denoted by min-cut{Vi,V2). By the max-flow 
min-cut theorem [13], the minimum cut is also the maximum 
rate that can be transmitted from Vi to V2. By a solution to a 
given problem we mean an assignment of appropriate coding 
vectors to each edge so that the required network connection 
can be supported. 

III. Results 

The following theorem is the main result of this paper 
Theorem 1: Consider a communication network modelled 
by a directed graph G — (V, E) with one source node S and 
two terminal nodes Ti and T2. Three independent processes 
Xq,Xi and X2 are observed at S such that H{Xq) = 
Hq, H{Xi) = Hi and H{X2) = H2. Ti is interested in 
receiving (Xq, Xi) and T2 is interested in receiving {Xq, X2). 
If 

min-cut{S,Ti) > Hq + Hi, (1) 
rain-cut{S,T2) > Hq + H2 and, (2) 
rmn-cut{S,{Ti,T2)) > Hq + Hi + H2 (3) 




Fig. 1. The figure shows the augmented graph Gi. The original graph 
G comprises of S, T\ , T2 and the network. The augmented graph Gi also 
contains the virtual terminals T[ and Tj and the nodes Yi and Y2 . The virtual 
edges are denoted by dashed lines and their capacities are labelled. 

there exists a solution where Xi can be routed to Ti, X2 
can be routed to T2 and Xq can be sent to both Ti and 
T2 via network coding. Conversely if any of the inequalities 
(17} - are violated then the connection cannot be supported. 

We defer the proof of this theorem until we have established 
a lemma that is required. We start by defining an augmented 
graph Gi = {Vi,Ei) as depicted in Fig. [l] 

1) The new vertex set is Fi = F U {r{, T^, Yi, ^2} as 
shown in Fig. [U T{ and can be regarded as virtual 
terminals, where the data is actually decoded. Yi and 
Y2 are virtual nodes introduced for the purposes of our 
proof. 

2) The capacity assignments of the new edges are 

cap (Ti ^ r;) ^ Ho + H,,cap {Ti ^ Y,) = 
Ho + H^,cap {Ti ^ Y2) = H^,cap (T^ ^ T^) = 
Ho + H2, cap {T^ ^ Fi) = H2 and cap (T^ ^ Y2) = 

Ho + i?2- 

Lemma 1: For the augmented graph Gi the following is 
true :- 



min-cut{S, Ti) 


= Ho- 


hi?i 




(4) 


min-cut{S, Tj) 


= Ho- 






(5) 


min-cut{S, {Ti, Tj)) 


> Ho- 


hi?l - 


|-i?2 


(6) 


min-cut{S, Yi) 


= Ho- 




^H2 


(7) 


min-cut{S, I2) 


— Ho- 


hi?l - 


Fi?2 


(8) 



Proof:- The first two equalities are obviously true. To see that 
min-cut{S, Yi) ^ Ho + Hi + H2 note that all cuts between 



S and Yi can be divided into four types: 

a) The cut (G, G^) such that S,Ti,T2 € G and Yi G G^ 
By inspection such a cut has capacity larger than or equal 

to H0 + H1+ H2. 

b) S,Ti e G and r2, Yi e G^ 

The 'min-cut{S, T2) > Hq + H2 and min-cut{Ti, Yi) = 
Ho + Hi and the edges connecting Ti and Yi are 
independent of the edges connecting S and Yi. This 
means that such a cut has capacity at least 2Ho+Hi+H2. 

c) S',T2 e G and Ti,Yi e G^ 

The min-cut{S, Ti) > Ho + Hi and min-cut{T2, Yi) = 
H2 and the edges connecting S to Ti are independent of 
the edges connecting T2 to Yi. This means that such a 
cut has capacity at least Ho + Hi + H2. 

d) S eC and Ti,T2,Yi e G^ 

Since the min-cut{S, (Ti, T2)) > Ho + Hi + H2, there- 
fore any such cut has capacity at least Ho + Hi + H2. 

Finally, the sum of the capacities on the incoming edges of Yi 
is exactly Ho + Hi + H2. This means that min-cut{S, Yi) = 
Ho + Hi + Hz- The other statements in the lemma can be 
shown to be true in a similar manner. ■ 

Using the augmented graph Gi we shall now demonstrate 
the existence of a certain number of paths from S to Ti and S 
to T2 over which data can be routed. Further, we shall show 
that it is possible to send the remaining data via network 
coding such that the demands of each sink are satisfied. The 
arguments proceed by utilizing the minimum cut conditions 
and performing a simple graph-theoretic procedure on the 
chosen paths in Gi. The details are given below. 

Proof of Theorem [T] :- 

First let us consider the paths from S to Yi and S to T2. 
Using Menger's theorem (see the book by van Lint & Wilson 
[13]) we can conclude that : 

> There exists a set of {Hq + Hi+ H2) edge-disjoint paths 
from S to Yi from Q. We call this set Q. 

m There exists a set of {Ho + H2) edge-disjoint paths from 
S to TI, from ©. We call this set TI. 

Now, we color the edges in paths G Q, green and the edges 
in paths G TZ, red. At the end of this procedure some edges 
on these paths may have just one color while others may have 
two. 

We claim that it is always possible to find Hi exclusively 
green paths (i.e. paths that contain edges only having the 
color green) from S to Ti. The technique of proof is similar 
to the one used in [10][14]. To prove this we define an 
algorithm A that shall be applied to a path P ^ Q. 

Algorithm A (P) :- 

1) Traverse P starting at S and find the first edge ei that 
has color (green, red) 

2) If no such ei is found then STOP. 

3) ELSE 

Suppose ei G P' where P' e TZ such that P' = Pi - 



2 



ei — P2 where P{ is the portion of P' from S* to ei and 
P2 is the portion of P' from ei to Color all edges 
on P from S" to ei, red in addition to their current color 
and remove red from the edges in P[. We now define a 
condition that each path P £ Q needs to satisfy. 



Cond{P) = {All edges in P are green} 

or {First edge of P is (green, red)} 



(9) 



We continue applying A to each path of Q until all paths 
in Q satisfy Cond. It is easy to see that A will eventually 
halt (for a proof see [10]). 

At the end of this process we realize that there exist Hi 
paths belonging to Q that are exclusively green. This 
is true since if Algorithm A re-routes a path G 7?. it 
removes the color red from one outgoing edge of S and 
places it on another outgoing edge. Therefore the total 
number of outgoing edges that are colored red remains 
constant at Hq + H2. It follows that Hq + Hi + H2 - 
[Hq + H2) = Hi outgoing edges are colored green and 
since the paths obey Cond all those paths are exclusively 
green. 

Next we note that all the exclusively green paths need 
to pass through T[ since T2 has exactly (Hq + H2) 
incoming edges all of which have to be colored red. 
This proves the claim made above. 
The critical point to be realized is that the re-routing of paths 
as above gives us Hi paths from S to T[ that are interference- 
free since these paths do not intersect with the paths from S to 
Tj. This means that data on these paths can be simply routed. 
Applying exactly the same procedure on the set of paths from 
S to Y2 and S to T{ gives us H2 paths from S to that are 
interference-free. 

Now suppose that these paths (Hi paths from S to Ti and 
H2 paths from S to T2) are removed from Gi to obtain a new 
graph G2. Note that there still exist Hq paths from S to T[ 
and Hq paths from S to in G2. In other words, even after 
the removal of the interference-free paths the maximum flow 
from S to T{ and S to T2 in G2 is Hq. Using the multicast 
result of [1] we can surely transmit the same Hq bits from S 
to r{ and T2 via network coding. 

Thus, the entire solution can be realized by an appropriate 
choice of paths such that, 

1) Hi bits (process Xi) can be routed from S to T[ and 
H2 bits (process X2) can be routed from S to 

2) Hq bits (process Xq) can be sent to both T{ and by 
linear network coding [2]. 

Finally we note that it is trivial to reahze the virtual terminals 
T{ and T2 at the terminals. 

The proof of the converse is easy to see since even if one of 
the inequalities ([T]i - (|3]l is violated then at least one terminal 
does not have enough capacity to support its demand. This 
completes the proof of Theorem [T] ■ 

It is possible to find networks where one needs to strictly 
perform network coding for transmitting Xq (while routing 
Xi and X2) and hence our result is tight. A simple example 




Fig. 2. The sources observed at S ai'e such that Hq = 2, Hi = H2 = 1. 
The figure shows a networlc where it is necessary to send Xq via network 
coding. All links have unit capacity. 



that demonstrates this is provided in Fig. |2] Here we have 
i/o = 2 and i?i = i?2 = 1- In Fig. |2] note that the 
min-cut{S, {Ti,T2)) = 4. Therefore among the outgoing 
links from S namely 1 ^ 6, 1 — ^ 2, 1 ^ 3, 1 — > 7, one 
link needs to carry Xi, one link needs to carry X2 and the 
remaining two links can carry a combination of the bits from 
Xq. By the rate requirements at the terminal it is easy to see 
that the combination of the Xq's needs to be carried on links 
1^2 and 1^3. This means that the solution needs to 
be realized by routing Xi on link 1^6, routing X2 on link 
1^7 and using the remaining part of the network to transmit 
Xq. However the remaining part of the network is precisely the 
celebrated butterfly example of [ 1 ] and we know that network 
coding is essential for transmitting Xq over it. 

IV. Conclusion 

We found the capacity region for a network information 
transfer problem with a single source and two terminals when 
the use of network coding is permitted by utilizing a simple 
graph-theoretic procedure that may be of independent interest. 
It is interesting to note that the use of network coding permits 
us to obtain a tight characterization of the capacity region of 
this problem. However the region for the general broadcast 
channel with two receivers is still unknown (this was also 
noted by [12]). 

V. Acknowledgement 

This work was supported by the University of California 
and Texas Instruments through UC Discovery grant COM04- 
10155 and National Science Foundation grant CCR-0209110. 

References 

[1] R. Ahlswede, N. Cai, S.-Y. Li, and R. W. Yeung, "Network Information 
Flow," IEEE Trans, on Info. Tk, vol. 46, no. 4, pp. 1204-1216, 2000. 

[2] S.-Y. Li, R. W. Yeung, and N. Cai, "Linear Network Coding," IEEE 
Trans, on Info. Tk, vol. 49, no. 2, pp. 371-381, 2003. 

[3] R. Koetter and M. Medard. "Beyond Routing: An Algebraic Approach 
to Network Coding," in IEEE Infocom, 2002. 



3 



[4] S. Jaggi, P. Sanders, P. A. Chou, M. Effros, S. Egner, and L. Tolhuizen, 

"Polynomial time algorithms for multicast network code construction," 

Submitted to IEEE Trans, on Info. Th. 
[5] T. Ho, R. Koetter, M. Medard, M. Effros, J. Shi, and D. Karger, "Towards 

a Random Operation of Networks," Submitted to IEEE Trans, on Info. 

Th. 

[6] R. Dougherty, C. Freiling, and K. Zeger, "Insufficiency of Linear Coding 
in Network Information Flow," Submitted to IEEE Trans, on Info. Th. 
[7] M. Effros, M. Medard, T. Ho, S. Ray, D. Karger, and R. Koetter, 
"Linear Network Codes: A Unified Framework for Source, Channel and 
Network Coding," in Proceedings of the DIMACS Workshop on Network 
Information Theory, 2003. 
[8] T. Ho, M. Medard, M. Effros, and R. Koetter, "Network Coding for 

Correlated Sources," in CISS, 2004. 
[9] A. Ramamoorthy, K. Iain, P. A. Chou, and M. Effros, "Separating 
Distributed Source Coding from Network Coding," in 42nd Allerton 
Conference on Communication, Control, and Computing, 2004. 

[10] , "Separating Distributed Source Coding from Network Coding," 

Submitted to IEEE Trans, on Info. Th. 

[11] C. K. Ngai and R. W. Yeung, "MultiSource Network Coding with Two 
Sinks," in IEEE ICCCAS, 2004. 

[12] E. Erez and M. Feder, "Capacity Region and Network Codes for Two 
Receivers Multicast with Private and Common Data," in Workshop on 
Coding, Cryptography and Combinatorics, 2003. 

[13] J. H. van Lint and R. M. Wilson, A Course in Combinatorics. Cam- 
bridge University Press, 2001. 

[14] K. Iain, M. Mahdian, and M. R. Salavatipour, "Packing Steiner Trees," 
in SODA, 2003. 



4 



